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Abstract. For 1 < p < oo, weight w € Ap and any L^-bounded Calderon-Zygmund 
operator T, we show that there is a constant Cr.p so that we have the weak and strong type 
inequahties 

\\T\if\\LP--^(w) < CTj\w\\Aj\f\\LP(w) 

where Tt] denotes the maximal truncations of T, w is a weight, and HwHap denotes the 
Muckenhoupt Ap characteristic of w. These estimates are not improvable in the power of 
llwll^p. Our argument follows the outlines of the arguments of Lacey-Petermichl-Reguera 
(Math. Ann. 2010) and Hytonen-Perez-Treil-Volberg (arXiv, 2010) with new ingredients, 
including a weak-type estimate for certain duals of T^^ , and sufficient conditions for two weight 
inequalities in for Tij. Our proof does not rely upon extrapolation. 
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1. Overview and Introduction 

Our subject is weighted inequalities for maximal truncations Tt, of Calderon-Zygmund 
operators. There are two main results. First, we prove weak and strong norm estimates on 
Tt, on LP{w), that are sharp in the Ap characteristic of the weight w. In the generality of this 
paper, this was only known for the untruncated operators, a question investigated by many, 
culminating in the definitive result in [7]. 

Second, for dyadic Calderon-Zygmund operators, termed Haar Shift operators, we prove 
sufficient conditions for the weak and strong type two- weight inequalities T^. These estimates 
are effective in terms of a notion of complexity for the Haar shift, and while providing only 
sufficient conditions, are sharp enough in the Ap setting that we can conclude our first result 
from them. 

We recall definitions. 

Definition 1.1. A Calderon-Zygmund operator in M'^ is a bounded in integral operator 
with kernel K{x, y), defined by the expression 

{Tf,g)= J I fix)g{y)Kix,y) dy 

for all continuous compactly supported functions f,g with dist(supp(/), supp((7)) > 0. The 
kernel K{x,y) satisfies the following growth and smoothness conditions for x,x',y,y' G M"^, 
xj^y 

\Kix,y)\ < x,yeR'', x^y. 

\x — y\^ 



\K{x, y) - K{x', y) \ + \K{y, x) - K{y, x')\ < Ct^-^^ , |a; - a:'| < |x - y\/2. 
Here, Ct is an absolute constant. We denote the maximal truncations of T by 

T^/(a;) := sup / f{y)K{x,y) dy 

0<e<u Je<\y\<u 

It is well-known that T and extend to bounded operators on L^(M.'^), for 1 < p < oo. 

Prominent examples include the Hilbert and Beurling transforms, as well as the vector R 
of Riesz transforms. If w is a weight on M'^, namely a non-negative measure, with density also 
denoted as w that is non-negative almost everywhere, it is well-known [6] that R is bounded 
on LP{w), 1 < p < oo, if and only if w satisfies the famous Muckenhoupt Ap condition 

\\w\\ap ■= suplQT"^ / w{dx) \Q\'^ I a{dx)^ 
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where a is the weight with density w~'^^^'^~^\ which is dual to w. Note that Uti'lUp is certainly 
not a norm. 

On the other hand, determining the sharp dependence of Calderon-Zygmund operators on 
the quantity is not straight forward, as first pointed out by Buckley [2]. This direction 

has been intensively studied in recent years, with the sharp result for T established in [7], 
following the contributions of several. We refer the reader to the introductions of [4,7, 11,22] 
for more information about the history and range of techniques brought to bear on this 
problem. 

Our first main result is this Theorem. 
Theorem 1.2. For T an L^(]R'^) bounded Calderon-Zygmund Operator, 

\mf\\LP---{w) < Ct||w|UJ|/||lp(«;) , 1 < p < 2, 

Well known examples involving power weights (see the conclusion of [22]) show that all 
the estimates above are sharp. Indeed, these bounds match the best possible bounds for the 
untruncated operator T. The weak-type estimate was conjectured by Andrei Lerner [18], 
who also conjectured that the maximal truncations should have the same behavior in the 
Ap charateristic as the untruncated operators (personal communication). As far as we are 
aware, this is the first place in which the sharp estimates for T^^ have been established, and 
the weak-type inequality is new even for untruncated T. 

We move to a discussion of the proof strategy for this Theorem. We will follow the outlines 
of the argument of [11], but the underlying details are substantially different. The strategy 
is summarized in Figure 1, and has the following points. 

We begin with a Calderon-Zygmund Operator T, and the important step, identified in [7], 
is to write T as a rapidly convergent sum of Haar Shift Operators Sm,n- See Definition 2.3, 
and Theorem 2.5. 

Haar Shift Operators are themselves dyadic variants of Calderon-Zygmund Operators, and 
come with an essential notion of complexity, which is the measure of how many inter-related 
dyadic scales the operator has. As Calderon-Zygmund Operators, they satisfy many estimates 
already, but it is a vital point that in order to use the fact that T is a rapidly convergent 
sum of these operators, all relevant estimates must be shown to be at most polynomial in 
complexity. We will refer to this as an effective estimate. This requires that we revisit most 
facts about these operators, and verify that they meet this requirement. 

The next crucial stage, the most complicated part of this argument, is to prove reasonably 
sharp two weight inequalities for Haar Shift Operators. The import here is that to prove our 
theorem, much of the argument must work in the generality of the two weight setting for a 
dyadic Calderon-Zygmund Operator. That the weight is in Ap is a fact that can only be used 
very sparingly. In this, we are following the pattern of [7-9, 12]. 

All of these prior works depended upon two-weight inequalities for the untruncated oper- 
ator, and only in . Here, we are concerned with two-weight inequalities for the maximal 
truncations; these estimates will apply in all spaces, an important point as concerns the 
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weak-type inequality. These estimates are taken up in §4, with the weak-type estimate being 
simple, and the strong type estimate being the most complicated estimate. Different variants 
of this argument have been used in [8,9, 14, 15], with the point here being that the estimates 
in §4 track complexity. See this section for more history on these estimates. 

The essential consequence of the two-weight inequalities is that they reduce the question of 
estimating the norm of T to that of testing the norm on a much simpler class of functions — 
weighted indicators of intervals. These conditions are in turn verified by using a chain of 
arguments that begins with the verification of certain weak-L^ inequalities for the Haar Shift 
Operators. We need these weak-type bounds for the adjoints of all linearizations of the 
maximal truncation operator. This is an estimate not of a classical nature, and is taken up 
in §9. 

This weak-integrability has a certain measure of uniformity. This permits the use of a John- 
Nirenberg Inequality that shows that uniform weak-integrability actually implies exponential 
integrability. This principle, again needed for certain maximal truncations, is formalized in 
§10. 

In order to apply the John-Nirenberg Inequality, with the weight w fixed, we should de- 
compose the collection of dyadic cubes into a Corona Decomposition. As we work with Haar 
Shifts, a decomposition of the cubes leads automatically to the decomposition of the operator 
S. This leads to a decomposition of S{w1e) into terms which are individually very nicely 
behaved. 

Finally, the testing conditions can be verified, and using the exponential integrability from 
the Corona Decomposition, one can give a simple verification of these conditions. This part 
of the argument is new to this paper. This argument will not appeal to extrapolation, a 
common technique in this subject. Indeed, the weak type estimate we prove does not seem 
to lend itself to extrapolation. 

In the ultimate section, we provide some variations and consequences of Theorem 1.2. 
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2. Haar Shift Operators 



In this section, we introduce fundamental dyadic approximations of Calderon-Zygmund 
operators, the Haar shifts, and make a detailed reduction of the Main Theorem 1.2 to a 
similar statement. Theorem 2.10, in this dyadic model. 

Definition 2.1. A dyadic grid is a collection V of cubes so that for each Q we have that 

(1) The set of cubes {Q' G V : \Q'\ = \Q\} partition M'^, ignoring overlapping boundaries 
of cubes. 

(2) Q is a union of cubes in a collection Child((5) C V, called the children of Q. There 
are 2'^ children of Q, each of volume \Q'\ = 2~'^\Q\. 

We refer to any subset of a dyadic grid as simply a grid. 

The standard choice for V consists of the cubes 2^^ nf=i['^s' + 1) k,ni, . . . ,nd G Z. 
But, the main result of this section. Theorem 2.5, depends upon a random family of dyadic 
grids. 

In higher dimensions, we mention that the martingale differences are finite rank projections, 
but there is no canonical choice of the Haar functions in this case. We make the following 
definition. 

Definition 2.2. Let Q be a dyadic cube, a generalized Haar function associated to Q, hq, 
is a linear combination of the indicator functions {1q} U {1q> : Q' is a child of Q}, 

= ^ Cq'Iq' 

Q'eChild(Q) 

We say hq is a Haar function if in addition j hq = 0, that is, a Haar function is orthogonal 
to constants on its support. 

Definition 2.3. For integers (m,n) G Z^, we say that a linear operator S is a (generalized) 
Haar shift operator of complexity type (m, n) if 

Q',R'CQ 

where here and throughout i{Q) = \Q\^^^, and 

• in the second sum, the superscript ^"^'^^ on the sum means that in addition we require 
£(Q') = 2-'"£(Q) and i{R') = 2-"£(Q), and 

• the function h^, is a (generalized) Haar function on R', and /cg' is one on Q', with 
the normalization that 

Wh^'W <1 IIA-^'ll <1 

Here, and throughout the paper, i{Q) = \Q\^^'^ is the side length of the cube Q. We say that 
the complexity of § is k := max(m, n, 1). 
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A generalized Haar shift thus has the form 

= y2 ]7V\ I ^0 ^ dy= I Ks (x, y) f (y) dy, 

where sq, the kernel of the component Sg, is supported on Q x Q and < 1. It is easy 

to check that 

\Mx,y)\<-^^. 

\x — 

The Haar shifts are automatically bounded on with || 8/11^2 < ||/||l2. This follows 
from the imposed normalizations and simple orthogonality considerations. It is clear that all 
restricted shifts Sq/ = X^qsq^q/ ^Iso Haar shifts, and hence uniformly bounded on 
for any Q (ZT>. We will extensively exploit these restricted shifts in the argument. 

The generalized Haar shifts are only of relevance to us in two particular special cases 
of complexity type (0,0), where Sq/ = \Q\~^{f, hQ)kQ = \Q\~^{f, hQ)kQ. These are the 
paraproduct, where Hq = 1q and kg is a Haar function for all Q, and the dual paraproduct, 
where hq is a Haar function and kq = 1q for all Q. 

It is well-known that the (normalized) boundedness of a (dual) paraproduct is equivalent 
to the Carleson condition 

Wkqwi. <\R\ [Y. w^qWI' ^ i^i) ^ ^- 

QC-R QC-R 

These conditions are also uniformly inherited by all restricted (dual) paraproducts Sq. 
Note that for both Haar shifts and the paraproduct, we have 

Sq/ = \/QeV, 

an important cancellation property in some of the later arguments. This is not the case for 
the dual paraproduct, for which a separate case study is needed at some points. 

Remark 2.4. Let 

5 (x, y) = min {i (Q) : x,y e Q e V} 

be the dyadic distance between x and y. The kernel Ks{x,y) of S satisfies the size and 
smoothness conditions for a dyadic Calderon-Zygmund kernel: 

\Kn(x,y)\ < ^ 



6 {x,yf' 



\K,{x,y)-K,{x',y)\ = if < ^, 

\K,{x,y)-K,{x,y')\ = if < ^. 

This is a more general dyadic kernel condition than one studied in [1], called perfect dyadic, 
which corresponds to m = n = in our framework. 
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The relevance of Haar shifts to Classical Analysis is explained by the following Theorem, 
one of the main results of [11] (see [11, Theorem 4.1]; also [7, Theorem 4.2]). This Theorem 
must be formulated in terms of a random dyadic grids. But the nature of this construction 
of grids is immaterial to the arguments of this paper, and refer the reader to these references 
for proofs, history, and further discussion of this result. 

Theorem 2.5. There is a collection of random dyadic grids {T>p : (5 G /3}, with expectation 
operator E/j, for which the following holds. Let T he a C alder on- Zygmund Operator T with 
smoothness parameter 5. Then, for all hounded and compactly supported functions f and g, 
we can write 

{Tf,g) = CE^ J2 2-^-^^-^'/'{Sij,g) 

(m,n)eZ2_ 

where 

• S^n 0, Haar shift of complexity type {m,n) for all (m, n) G \ {(0, 0)}; 

• Sq Q is the sum of a Haar shift of type (0, 0), a paraproduct, and a dual paraproduct; 

• the constant C is a function of T , and of the smoothness parameter 5. 

In particular we have the uniform estimate „||i2_j.^2 < 1. 

We define the maximal truncations of a Haar Shift as follows. 

Definition 2.6. Suppose that § is a generalized Haar shift. Define the associated maximal 
truncations by 

\f{x)= sup |§e,,;/(a;)| , 

0<e<u<oo 

(2.7) §,,„/^ 

Qe»: (-<t{Q)<v 

Proposition 2.8. We have the pointwise hound 

TJ{x) < C{T)Ep 2-(™+'^)^/2(§^,J^/(x) + C{T)Mf{x), 

where M is the Hardy-Littlewood maximal operator. 
Proof. Theorem 2.5 says that 

{Tf,g)=Cm E 2-(-+"W2(§^_„/,^) 

{m,n)eZ2 

for bounded and compactly supported functions / and g. By choosing 

'^B(v.e) , ^B(x 
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and taking the limit as e — )■ 0, dominated convergence implies the pointwise identity 

m,n / \ 
-(m+n)5/2 K^^y) 



m,n j 

K{x,y)=C{T)Ep 2-(-+-W^J2"' 



For e > 0, this implies by Fubini's theorem that 
K{x,y)f{y) dy 



\x-y\>e 



C{T)E, J2 S-^'"''"^^/^ E / s-'-{x,y)f{y)dy. 



Let us then decompose 

I SQ^\x,y)f{y)dy= Vf\ 

e(Q)>e' 



sZ'''{^^y)f{y) dy 



~ J2 jk\ I ^Q'''i^^y)fiy)dy 



+ X] ^ / SQ'''{x,y)f{y)dy 



l(Q)<e' 
= 1 + 11 + III, 

where e' = e/{2\/d). By definition, there holds |/| < There also holds 

|//| < 2-'^ I I/I < e'^( Y 2-''')Mf{x) < Mf{x). 

Furthermore, we actually have /// = 0, since there we must have |a; — ?/| < d{Q) = \/di{Q) < 
y/de' = e/2 < e. We have thus shown that 

TJ{x) < C{T)Ep E 2-(™+")^/2(§^,J^/(x) + C{T)Mf{x) 

for every e > 0, and from this the proposition follows. □ 
Proposition 2.8 and Buckley's [2] sharp weighted bounds for the maximal operator, 

(2.9) iiM/iu.,^(^) < \\w\\]inf\\L.H, iiM/iu.(^) < ikiiyj^'-'^ii/iu.w, 



reduce the proof of the Main Theorem 1.2 to the verification of the following dyadic variant, 
a task which occupies the rest of this paper. 
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Theorem 2.10. Let E> be a Haar shift operator with complexity k, a paraproduct, or a dual 
paraproduct. For 1 < p < oo and w G Ap, we then have the estimates 

(2-11) < /tlklUJI/llLnt") 1 

(2.12) < 4wnr^''^'''^''^\\f\\Lr'M 

Indeed, any polynomial dependence on the complexity parameter k would suffice for The- 
orem 1.2, but a careful tracing of the constants will even provide the linear dependence, 
as stated. Even for the untruncated shifts S in L'^{w), this improves on the quadratic in k 
bound established in [1 1] (but we are not aware of an application where this precision in the 
dependence on k would be of importance). 

The dependence on k and the weight constant ||w|Up arises from the following points of 
the proof below: First, we establish two-weight inequalities of the form 

ll^h(/^)IUp'°°{ti') ~ {^^Jtp^weak + ||/||lp((t), 

where weak and Tip are the best constants from certain maximal inequalities, while Tp 
and are the best constants from appropriate testing conditions for the operator Sti- Here, 
w and a are allowed to be an arbitrary pair of weights, with no relation to each other. 

Second, we specialize to the one-weight situation with a = w^~^ , using a well-known dual- 
weight formulation of the bounds to be proven, (2.11) and (2.12). We need to estimate the 
above four constants in this situation. The maximal constants are independent of S[, and 
thus of K, and they satisfy 9Jlp,weak ^ ll^llYj and OJlp < by the sharp maximal 

function inequalities of Buckley (2.9). For the two testing constants related to S^, we obtain 
the linear in k bounds 

This dependence comes from the fact that the proof of the John-Nirenberg style estimates 
of (10.13) requires separating the scales of § by dividing it into k + 1 parts, each of which 
contains nonzero components §q only for a fixed value of \og2^{Q) mod (/t + 1). For these 
separated parts of S, our bounds will be independent of k, and it remains to sum up. 

3. Linearizing Maximal Operators 

A fundamental tool is derived from (the usual) general maximal function estimates that 
hold for any measure. In particular, for weight w we define 

M^/(x)^sup 1q(x)E^|/|, 

E-/^^Q)-i f fw{dx). 
Jq 

The notation Eg/ means that the implied measure is Lebesgue. It is a basic fact, proved by 
exactly the same methods that prove the non-weighted inequality, that we have the estimate 
below, which will be used repeatedly. 
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Theorem 3.1. We have the inequalities 

(3.2) \\M^f\\Lr'M<\\f\\L.M, l<P<oo. 

We use the method of hnearizing maximal operators. This is famihar in the context of 
the maximal function, and we make a comment about it here. Let {E{Q) : Q G P} be 
any selection of measurable disjoint sets E{Q) C Q indexed by the dyadic cubes. Define a 
corresponding linear operator by 

Qev 

Then, the universal Maximal function bound (3.2) is equivalent to the bound ^ 
with implied constant independent of w and the sets {E{Q) : Q G V}. This 
estimate will be used repeatedly below. 

There is a related way to linearize S^, which deserves careful comment as we would like, 
at different points, to treat §[, as a linear operator. While it is not a linear operator, S^, is a 
pointwise supremum of the linear truncation operators and as such, the supremum can 
be linearized with measurable selection of the truncation parameters. 

Definition 3.3. We say that L is a linearization of §^ if there are measurable functions 
e{x),v{x) G (0, oo) and {}{x) G [0,27r) such that, using (2.7), we have 

L/(x) = e*'^(^)§,(,),,(,)/(x) > 0, X G M'^. 

Note that the requirement hf{x) > defines 'd{x) everywhere except when Se(x),D(x)/(a^) = 
0. Also, for fixed / we can always choose a linearization L so that En^f{x) < 2L/(x) for all x. 

A key advantage of L is that it is a linear operator, and as such it has an adjoint, given 
by the formal expressions 



Qev 

(3.4) 



Qev 

^J7V\ I SQ(x,?/)l{,(^)<^(Q)<„(^)}e*''(^V(dx). 

\^\ Jq 



Qev 

The following 'smoothness' property of L* is an important observation in the proof of our 
two weight estimates. 

Lemma 3.5. Suppose that for a measure v and cube Qo we have |z/|(Qo) = 0. Suppose that 
S has complexity type {m,n). Then L*z/(-) is constant on subcubes Q' C Qo with 1{Q') < 
2-^i(Qo)- 



Proof. For y G Qo, the sum in (3.4) defining the adjoint operator becomes 

QeV: Q2Qo ^ 
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As a function of ?/, the kernel SQ{x,y) is constant on the subcubes oi B! G Q with 1{B!) < 
2^"'~^i{Q). Thus for Q 3 Qo^ it is in particular constant on the subcubes Q' C Qo with 
iiQ') < 2-"£(go). □ 

4. The Two Weight Estimates 

We are interested in tracking complexity dependence in two weight inequalities for Haar 
Shift Operators, as defined in §2. We study the maximal truncations of such operators, and 
obtain sufficient conditions for the weak and strong type {p,p) two weight inequalities for 
such operators. Our main results are Theorem 4.3 for the weak type result, and Theorem 4.7 
for the strong type result. These Theorems give sufficient conditions in terms of the Maximal 
Function, and certain testing conditions. Of particular import here is that these sufficient 
conditions are efficient in terms of the complexity of the Haar Shift operator. 

Our primary focus concerns extensions of the dyadic Tl Theorem to the two weight setting. 
These considerations are motivated in part by a well developed theory of two weight esti- 
mates for positive operators. These Theorems have formulations strikingly similar to the Tl 
Theorem, which theory encompasses the Theorems due to one of us concerning two weight, 
both strong and weak type, for the maximal operator [23] and fractional integral operators 
[24], [25]. There is also the bilinear embedding inequality of Nazarov-Treil-Volberg [21]. We 
refer the reader to [13] for a discussion of these results. 

There is a beautiful result of Nazarov-Treil-Volberg [20], a two-weight version of the Tl 
theorem. A subcase of their result was proved for Haar Shifts, with an effective bound on 
complexity in [11, Theorem 3.4]. 

Theorem 4.1. Let S be a Haar Shift operator of complexity k, as in Definition 2.3. Let a, w 
he two positive locally finite measures. We have the inequality 

where the three quantities above are defined by 



\w, cr\\A2 '■= sup 



Q \Q\ \Q\ 

G :=supa(g)-i/2||lQ§(alQ) 



(4.2) e* ■.= snpwiQ)-'/^\\lQS*iwlQ)\\m.). 

Q 

The first of the three conditions is the two weight A2 condition; the remaining two are the 
testing conditions. The proof is fundamentally restricted to the case of p = 2, nor does it 
address maximal truncations. We will consider the case of 1 < p < 00 and obtain sufficient 
conditions for the two weight inequalities for the maximal operator First we give the 
weak type result. Below, M denotes the maximal function. 
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Theorem 4.3. Let Ei be a generalized Haar shift of complexity k as in Definition 2.3. Then 
we have the weak type inequality 



(4-4) ~ (/t^Jlp.weak + 2:p) II J ii^p(^) 

where the constants 9Jlp,weak and Tp are the best such in the following inequalities 

,weak II /II Lp(o-) i 

(4.5) I S^(a/lQ) w{dx) < ||/||^,(^) wiQfl^'. 

J Q 

The point of this Theorem is that to check the weak- type inequahty for S^, it suffices to 
check the weak-type inequahty for the simpler maximal operator M, and to check only par- 
ticular instances of the weak-type inequality for §t|. It is also important that the complexity 
K appears with polynomial growth. 

The dual testing condition (4.5) looks rather complicated, with the appearance of / G L^{cr) 
in it. However, St, appears to just the first power, and it is a close relative of (4.2). Indeed, 
(4.5) has a more convincing formulation in the linearizations. It is equivalent to the dual 
testing condition 



I oo ' 



(4.6) \\lQh*{lQgw)\\^,,^^^ < IMQ)'^"' \\9 

This holds uniformly over all choices of linearizations, which fact is referred to repeatedly 
below. Inequality (4.6) reflects the fact that the dual of a weak type inequality is a restricted 
strong type inequality. 

Our strong type result will require duals L* of linearizations L of Sti in order to state the 
nonstandard testing condition in (4.8). These were defined in Section 3 above. 

Theorem 4.7. Let S be a generalized Haar shift of complexity k, as in Definition 2.3. We 
have the following quantitative estimate: 



LP(w) LP(cr) , 

where 1p is defined in (4.5), and the numbers Tip and Dip are defined in 

||M(/a)||^,(^)<9Jlp||/||^,(^), 

(4.8) sup sup— ^ / snplgf^— [ \h*{lQipw){y)\a{dy)] w{dx). 

As a new kind of complication compared to the weak-type case, we have the nonstandard 
testing condition (4.8). Its primary difficulty is the appearance of |L*| integrated over Q 
with respect to a, but then divided by w{Q) rather than the usual cr{Q). Also there is an 
additional supremum, with the argument of L* dependent upon the cube Q over which we 
are taking the supremum. 
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The method of proof is an extension of that of Sawyer's approach to the two weight 
fractional integrals [25], but see also [13]. This argument follows the outlines of the proof in 
[14], which proves variants of Theorem 4.3 and Theorem 4.7 for smooth Calderon-Zygmund 
operators. The current arguments are, naturally, much easier while retaining the essential 
ideas and techniques of [14]. (The reader can also compare the arguments of this paper to 
those of [13].) 

Let us give a guide to the next few sections of this paper, which are concerned with the 
proof of the above two-weight results. 

§5: Collects facts central to the proofs, maximal functions, linearizations of maximal 

functions, Whitney decompositions, and an important maximum principle. 
§6: The weak-type result Theorem 4.3 is proved. 

§7: Sufficient conditions for the strong type result are stated; the classical part of the 

proof of the strong type result Theorem 4.7 is begun. 
§8: This section contains the core of the proof of Theorem 4.7. 

5. Generalities of the Proof 

5.1. Whitney Decompositions. We make general remarks about the sets 
(5.1) fi, = {§^(/(T)>2'^} 

where / is a finite linear combination of indicators of dyadic cubes. For points x sufficiently 
far away from the support of /, we will have that S^(/cr) is dominated by the maximal 
function M{fa). Hence, the sets flk will be open with compact closure. 

Let Q^^^ denote the dyadic parent of Q, and inductively define Q^^~^^^ = {Q^^^)^^\ For 
a nonnegative integer (, let Qk be the collection of maximal dyadic cubes Q such that 
g(^) C Then 

Vlk = Q (disjoint cover), 

Q(^) C fifc , Q(^+^) n fi^, 7^ (Whitney condition,) 

Q ^ Qk , Q' ^ Qi , Q ^ Q' implies k > I (nested property) . 

Remark 5.2. In the proof of the weak type theorem we will take C = 0- the proof of the 
strong type theorem we will take ( = n + l<n + l, when the shift under consideration has 
complexity type {m,n). 

5.2. Maximum Principle. A fundamental tool is the use of what we term here as a 'max- 
imum principle' (we could also use the term 'good-A technique'): Subject to the assumption 
that the maximal function M is of small size, we will be able to see that the maximal trunca- 
tions are large due to the restriction of the function to a local cube. This leads to an essential 
'localization' of the singular integrals. 
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Theorem 5.3 (Maximum Principle). Let S be a generalized Haar shift of complexity type 
{m,n). For any cube Q E Qk as in the Whitney decomposition of Qk in- (5-1) above with 
parameter (, we have the pointwise inequality 

S^{fcT)ix)< sup |§,,„(/cT)(x)|+2^ + (C + n + l)M(/cr)(x) 
(5.4) e<y<m) 

< S^{flQa){x) + 2'= + (C + n + l)M{fa){x). 

There is a corresponding Maximum Principle in Section 3.3 of [13], which is very effective 
in the positive operator case. As our operators are not positive, and as we are ultimately 
only interested in the one weight situation, we have the maximal function M on the right in 
(5.4). 

Proof. Note that the second inequality of the claim is obvious, since S^^y{fa){x) = S^^y{flQa){x) 
when X E Q and e < v < i{Q). We prove the first inequality. 
Let X eQ e Qk, and let r := {( + 1) + {n + 1). Then 

§,,„(/a)(x)= J2 S/(/a)(x) 
r.e<e{i)<v 

= ( E + E + E 

r.e<e(i)<v r.e<e(i)<v r.e<i{i)<v 
e{i)<e{Q) f(/)>2'-f(Q) e{Q)<e{i)<2'-e{Q) 

= ^e,mm{v/(Q))ifcr)ix) + §max(E,2'■£(Q)),^.(/o^) (a^) + E 

r.€<e{i)<v 

e.(Q) <(.(!) <2-'t(Q) 

The first term on the right is clearly dominated by sup^/<„/<^(Q) |§e/,„/(/cr)(x)|. All Sj^fcr) 
participating in the second term have £{I) > 2'^^^i{Q^'^~^^'^), so they are constant on Q^'^'^^\ 
Hence we may replace x by some x G Q*-^^^-* \ ^k (which is nonempty by definition of Qk)- 
So this second term is dominated by 

S^{fa){x)<2K 

Finally, the last term contains at most r — 1 = C+n+1 summands, each of which is dominated 
byM(/a)(a;). □ 

6. Proof of the Weak-Type Inequality 
We prove Theorem 4.3, stating that 

.weak + "^p) 11/11 LP(cr)- 

To this end, we need to estimate the quantities 

wiS^ifa) >A-2')< w{M{fa) > v2') + w{S^ifa) > 4 ■ 2^ M(/a) < r/2^). 
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where the small parameter 77 is to be chosen shortly. Since C Jlfe = UgeSfe where we 
take the Whitney decomposition with parameter C = 0, we further have that 

w{S^,{fa)>A-2\M{fa)<rj2') 

^^w{Qn {S^ifa) > 4 ■ 2^ M{fa) < =: J] w{E,{Q)). 

QeQk QeQk 

On Ek{Q) C Q, the maximum principle gives that 

4 • 2*= < §^(/a) < §^(/lQa) + 2^' + {2n + 3)M(/a) 

< §^(/lQa) + 2^= + (2n + 3)772^= < S^(/1qct) + 2 • 2^ 

provided that 77 := (2k + 3)"^ < (2n + 3)"^ Thus 

§^(/alQ) > 2^=+^ on EkiQ). 
Putting these considerations together, we obtain (for another small parameter 5 > 0) 
(4-2^)fw(S^(/(7) >4-2*^) 

QeQk 

< 4^77-^||M(/<7)||i_(^) + 4^2^=^ Yl '^^(Q) 

w{Ek{Q))<5w{Q) 



Ik 

w{Ek(Q))>5w{Q) 



< 2Pri-mi 

weakll/llz,P(o-) + 4^^2'^^'U;(§1, (/(j) > 2*^) 



QeQk 

Picking a k for which the left side is close to its supremum, and choosing S — 10~^, we can 
absorb the middle term on the right to the left side. Recalling the choice of 77 and using the 
testing condition to the integrals in the last term, we obtain 

iMHWl^^i^) ^ «^9n^,weakll/iu.(.) + E w{Qy-^{%,\\fiQu.i.)w{Qy/^'Y 

QeQk 
QeQk 

<('^"2Jl^,wea^ + ia||/|U.(.), 

by the disjointness of the Q £ Qfe in the last step. 
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7. First Steps in the Proof of the Strong Type Inequality 
We start preparing for the proof of 

\Mf<j)\\LP(n.) < («9?tp + 1p + ^p)||/||LP(a). 

In this section, we make an estimate of the form 

\\^\iif(^)\\LP(w) < KOJlp II /II LP (^) + 5||S^(/o-)||lp(^) + remainder, 

where the second term with the small parameter 6 may be absorbed to the right, and the 
'remainder' will be controlled in terms on (Xp + ^p)||/||lp(o-) in the following section, which 
contains the core of the argument. 
We begin with 

= 4^ ^ J2 HQ n (Ofe+i \ n,+2)) 

=:4^^2'=^ w{F,{Q)). 

fcez QeQk 

Note that the sets Fk{Q) are pairwise disjoint. We first employ a similar reduction as in the 
weak-type case: 

w{Fk{Q)) < w{Fk{Q) n {M{fa) > 772^=}) + w{F,{Q) n {M{fa) < ^2"}) 
=: w{Fk{Q) n {M{fa) > r/2^}) + w{Ek{Q)). 

Then 

J2 E ^(^'^(^) n > ^2^) < E E / ^(/^)'^ 

k QeQk k geSfc 

<r7-^||M(/a)||i,(^)<r7-^9Jt^||/||i,(,). 

We are left with 

E E HMQ)) < E E 

w{Ek{Q))<5w{Q) 

+ E2'' E 

feez QeQk 

w{Ek{Q))>Sw{Q) 



w 



and the first sum on the right is immediately dominated by 
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SO this term can be absorbed after a suitable small choice of 5 > depending only on p. 

Now consider one of the remaining sets Ek{Q) for Q E Qk and w{Ek{Q)) > 5w{Q). By the 
maximum principle, we can choose a linearization L — e^'^^^^Ei^(^x),v{x) of S|] with v{x) < i{Q), 
so that, for x G Ek{Q), 

A-2'< ^^{fa){x) < 2Ufa)ix) + 2' + (C + n + l)Mifa){x), 

< 2h{fa){x) + + (C + n + l)r)2'' 

<2L(/(7)(x) + 2•2^ 

by choosing 

7]:={C + n+ 1)-^ = {2n + 2)"^ > k-\ 

Hence 

Ufa){x) > 2^ on Ek{Q). 

Notice that, by the disjointness of the sets Ek{Q) C Fk{Q), we can globally define one 
linearization L, which fulfills this condition on all E^iQ). 
Thus, for w{Ek{Q)) > 5w{Q), we have 

- \w{E,{Q)) Je,^q) ' ) - U(Q) ^^(Q) M ' 

where 

We have proven that (absorbing the term 5||St,(/(T)||^p(^^, and using rj~^ < k) 

(The dependence on 5 has been neglected, since this is in any case a function of p only.) 

8. Strong-type estimates: the core 

We are left to prove that 

(8.1) E ^iE,m / L(/<7)^ ' < (ip + vii/iUn.)- 

k QeQk ^KQ)JEk{Q) 

We can make the additional assumption that all k in this sum are of the same parity; after 
all, there are just two such sums. By monotone convergence, we may also assume that all 
appearing cubes are contained in some maximal dyadic cube Q. This allows to make the 
following construction: 

Definition 8.2 (Principal cubes). We form the collection Q of principal cubes as follows. 
We let Qq :— {Q} (the maximal cube that we consider), inductively 

[j {G' gG : E^,|/| > 4E^|/|, G" is a maximal such cube}, 

GeGk-i 
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and then Q := IJfclo^fe- dyadic Q (c Q), we let 

T{Q) := the minimal principal cube containing Q. 
From the definition it follows that 

E^|/|<4E?(Q)|/|. 

We begin the analysis of (8.1). Recall that on Ek{Q), we have L = L(1q ■ ). Thus we may 
dualize and split the cube Q to the result that 

h{fa)w= / h*{lE,(Q)w)fa 



(8.3) 



EkiQ) 



L*(l, 



Q\^k + 2 



L*(l 



Qnn 



EkiQ) 



w)fa. 



k + 2 



8.1. The part on Q \ ^7^+2- The first term is easy to estimate: 

^*i^Ek{Q)W)f(r < ||lQL*(l£,(Q)W)||ip'(^)||lQ\f7,^J||LP(<,) 

<2:,^(Q)1/^'||1q\o,^J|U.(.), 



Q\n 



k + 2 



and then 



J2 E ^(E,{Q))\^ I L*(l 



-"^pE E II^Q\f^fe+2/llLP(a 

fc QeSfe 



/■IIP _ q-f 



recalling in the last step that the k sum is over either odd or even k only. 



8.2. The part on Q fl We are left to estimate the integrals over Q fl i^A,.+2 as in the 

second term on the right of (8.3). Using 
nestedness of the collections Qa:, we have 



second term on the right of (8.3). Using Q G Qfc and f2fc+2 = U/jeg^. well as the 



L*fl 



EkiQ) 



w)ja 



w)/a. 



R6Qfe+2 

RdQ 



Now Ek{Q) C f^fc+2 C {R^'^^y, where C = ra+1, so h*(lE,.{Q)w) is in fact constant on i? G Qk+2 
(by the 'smoothness property' formulated in Lemma 3.5); thus 



R J R 
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Splitting into the cases according to the size of E^|/| relative to Ep^Q^|/|, we can thus estimate 

Di^n. . - vir-n "J R 



lf[lE^[Q)W) 



(8.4) 



E?^|/|<16E-(Q)|/| 



+ I |L*(U,(Q)t/^)|a-E?,|/|. 

E?jl/I>16E-(Q)|/| 



8.3. The part with E^|/| < 16E^(q)|/|. We estimate the first sum on the right of (8.4). 
As a first step, by the disjointness of the R G Qk+2 we have 



E?j|/|<16E-(Q,|/| 



\^*{^Eu{Q)W)\(y < / \L*{lEk{Q)W)\(T. 



Next, we make a manipulation involving random signs indexed by pairs k' G Z' and 

Q' G Qfc'. At almost every x. 



k' Q'eQy 

<E,|l*(^^ Y ^k',Q'lEy{Q')^QW^{x 



k' Q'ec 
=: E,\h*{^,lQw){x) 

where, by the disjointness of the sets Ek'{Q'), we have || 

^k',Q'- 

We can then compute 



< 1 for any choice of the signs 



k QeQk 
< 



1. n^Q^ ^W{Li} Jq 

ntzr. u nan, ) JQ 



k QeSfe 
r(Q)=G 



(by the preceding considerations) 



a 



k QG'^k 

r{Q)=G 



(by Holder's inequality and the linearity of Eg 
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<J2{^G\f\Y^s [ (sup / \h*{^,lQw)\aYw 

Jg^Qcgw{Q}Jq J 

(by the disjointness of the sets Ek{Q) 

< E (EGl/l)'^^cr(G) 

(by the non-standard testing condition) 
(by definition of principal cubes and universal maximal inequality.) 

8.4. The part with ]E^|/| > 16Epj.Q-||/|. We estimate the second term on the right of (8.4). 
Inserting a{R)'^^^^ into this sum and using Holder's inequality, we have 



5^ / |L*(l^,(Q)^)|a-E-|/| 



ReQk+2,RcQ 
E-|/|>16E-(Qj|/| 



ReQk+2 ReQk+2,RcQ 
RCQ E-|/|>16E-(gj|/| 

where the first factor is further dominated by 



ReSfe+2 
RCQ 



So altogether, and appealing to the testing condition (4.5), 

J2 E ^^^^(^))(^ E / \i^*{iEUQ)w)\^-^R\f\y 

k QeQk R&Qk+2,RcQ 

E?;I/I>16E-(Q)|/| 

(8 5) ^^^EE-(^^w))^ E -imR\f\r 

^ ' ■' k QeQk R&Qk+2,RcQ 

IEfll/l>16E?(Q)|/| 



P 



^^pEE E ^(R)i^n 
k QeQk ReQk+2,RcQ 
E?,|/|>16E-(Q)|/| 

The proof is completed by the following lemma: 

Lemma 8.6. Any given cube R appears at most once in the sum on the right of (8.5). For 

any two cubes Ri C R2 appearing in this sum, we have E/jJ/l > 4E^2|/|. 
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Proof. We can prove both claims with one strike as follows. Let Ri C R2 be cubes appearing 
in (8.5), possibly equal. Thus for some cubes Qi, we have 

(In the equal case, we want to prove that {ki, Qi) = {k2, Q2)', in the unequal case, the estimate 
between the averages.) Note that if ki = k2, then also -Ri = R2 and Qi = Q2, so there is 
nothing to prove. So let ki > k2, thus by the restriction on the fc-sum, in fact ki > k2 + 2. 
Since the cubes Qi G Qk^ and R2 G 2^2+2 intersect (on Ri) and fci > ^2 + 2, the nestedness 
property implies that Qi C R2, and hence T{Qi) C r(i?2). Thus 

E-J/l > 16E?(Q^)|/| > 16E?(^^)|/| >4E-J/|. 

For Ri = R2, this gives a contradiction, showing that the same R cannot arise in the sum 
more than once. And for Ri ^ R2, this is precisely the asserted estimate. □ 

From the lemma it follows that 

KSI/I>16E-(Q)|/| 

and the proof is complete. 

9. Unweighted weak- type (1,1) inequalities 

We are now finished with the two-weight theory, and we start anew from a different corner 
of our proof scheme, see Figure 1, the weak-type (1,1) estimates for Haar shift operators. 
Again, we need bounds that are effective in the complexity. The following estimate was proved 
in [7, Proposition 5.1], with the additional observation concerning shifts with separated scales 
made in [11, Theorem 5.2]. 

Definition 9.1. We say that a shift § of complexity k has its scales separated if all nonzero 
components Sq, Sqi have log2 i{Q) = log2 i{Q') mod (k + 1). We likewise say that a subset Q 
of the dyadic grid has scales separated if log2 i{Q) = loga i{Q') mod (k + 1) for all Q, Q' G Q. 

Proposition 9.2. An L"^ -hounded Haar shift operator § of complexity n maps L^{W^) into 
L}''^{W^) with norm at most C^k. //S has scales separated, then the norm is at most Cd- 

We need a strengthening of this Proposition for duals L* of the all linearizations L of the 
maximal truncation S[, of S. This type of result is not a classical one, and to prove it, we 
use a simplified version of the argument of [16] used to prove Carleson's Theorem on Fourier 
series [3]. 

Theorem 9.3. For an L"^ -hounded Haar shift operator § of complexity k, we have the fol- 
lowing estimate, uniform in X > 0, and compactly supported and hounded functions f on 



(9.4) 



A|{L7>A}|<k||/||i, 
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where the inequality holds uniformly in choice of the linearization L o/S|,. //S has its scales 
separated, we have the complexity-free bound 

(9-5) A|{L7>A}|< ll/lli. 

9.1. Case J Sqf = 0. We begin proving the weak type (1, 1) inequality with the additional 
hypothesis that the components of our shift S all satisfy / §q/ = 0. Note that this covers 
all cases relevant to us, except for the dual paraproduct case §q/ = \Q\^^ {f, hQ)lQ, which 
will be taken up in the next subsection. 

We start off with the Tree lemma, with this terminology and notation adapted from [16]. 

Lemma 9.6 (Tree lemma). Suppose that Q is a collection of dyadic cubes, all contained in 
some Qq. Then 

e{Q)<v(x) 

satisfies 

(9.7) |(Ls/, (7)1 < n ■ size/(Q) ■ dense,(Q) ■ |Qo|, 

where 

size/(Q) := sup (-^ / |/|^) ^ , denseg(Q) := sup sup / \g\. 

QeS^IWIjQ ^ Q&QQ'DQ \^ \ Jq' 

The factor k, may be omitted in (9.7) if the scales o/S are separated. 

Remark 9.8. The notation of 'size' and 'density' are derived from [16]. But size in this context 
is simpler, related to the maximal function (M/^)^/^. The double supremum in the definition 
of density is essential for the inequalty (9.9) below. 

Proof. We may assume that Qq G Q. For if not, let Qi be the maximal cubes in Q, all 
contained in Qq, and Qi := {Q & Q '■ Q C Qi}. Clearly the size and density of each Qi is 
dominated by the corresponding number for Q. Then we just write Lq = Xli^Si) ^^e the 
estimate for each Qi, and sum up J2i \Qi\ — \Qo\ in the end. 

Also assume by approximation that the collection Q is finite. Let V consist of the minimal 
dyadic cubes P G Qq such that P*^^-* contains some element of Q. The cubes in V then form 
a partition of [j{Q : Q E Q}. Define the maximal operator A4p by 

Mv<l){x) := Y] \9{x)\lpn{i(pW)<v{x)} sup Eq|0|. 

Note that ii P eV, then P^^^ contains a cube Q E Q with P C Q, whence 

< 2'^Ep(i)[|^|l|^(p(i))<„(.)}] < 2'^densej;(Q). 
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From this we conclude a particular restricted strong-type inequality for Aip: 

Mv(t) < V |P| ■ Ep[|^|l|^(p{i))<„(.)}] sup Eq|0| 

< 2'^denseg{Q) \P\ sup Eq|0| 

< 2'^denseg(Q) / M(f) 

(9.9) < 2'^dense,(Q)|gor/lA^0||2 < dense,(Q)|Qor/l0||2. 
Now we start to estimate the expression in (9.7). For x E P eV, we have 

QeQ, Q2P QeQ, Q^P 

e(Q)<v{x) e{Q)<v{x) 

since the summation is empty unless i{P) < v{x). Let Pv{x) ^ P he the unique dyadic cube 
with i{Pv{x)) = v{x). Then 

QeQ, Q2P QeQ QeQ 

^(Q)<,;(x) QDP Q^P^(^) 

For any dyadic R and x E R, we have 

(9.10) 5^§q/(x)=E^(5^§q/)+ ^ (§q/(x)-E^(§q/)), 
QeS QeQ QeQ 

which follows from the facts that E/j(§q/) = for Q G R, and Sqf is constant on i? for 
Q D R^'^K And here 

Y |Ek($^Sq/)| <2supEp|§Q/|, §q/:= 5^Sq/. 

^G{P,P^(^)} QeQ ^^-^ QeQ 

For the second sum in (9.10), observe that we have 

|e^(§q/)I < ||§q/iu ^j^J^lfl^ (mZ'^'')'^' " ''''^^^^ 

for each term, and there are at most Ak terms altogether for both R G {P,Pv(x)}- If the 
scales of § are separated, then there is at most one nonzero §Q with RCQ c R^''^ so there 
are at most 4 nonzero terms with the mentioned estimate, instead of 4k. This is the only 
place where the factor k enters into the argument, and the rest of the proof can be modified 
to the case of separated scales by simply substituting 1 in place of k. 
Substituting back, we have shown that 

|Eq/(x)| < l|,(p)<„(,)|(2supEp|§Q/| +4Ksize/(Q))lQ„, WxePeV, 

rdp 
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and hence 

|Lq/(x) ■ c/(a;)| < |^(a;)|lpn{£(p(i))<i.W} supE^(2|§Q/| + 4:Ksizef{Q)lQ^) 



= Mv^ix), := 2|§q/| + 4/€size/(Q)lQ„. 
An application of (9.9) then gives 

|(LQ/,<7)|<dense,(Q)-|gor/'-||0||2, 

and 

ll0l|2<||SQ(lQo/)ll2 + «:size;(Q)||lQj|2 

< ||1qJ||2 + ^sizef{Q)\Qo\'/' <(! + «:) size;(Q)|Qor/^ 

This completes the proof, recalling that in the case of separated scales we can take 1 in place 
of K above. □ 

The next two Lemmas present decompositions of collections Q relative to the two quantities 
of density and size. 

Lemma 9.11. Let Q be an arbitrary collection of dyadic cubes, g E L^, and 6 > 0. Then 
Q= Q'U [j. Qj where denseg(Q') < 6\\g\\i and Qj = {Q e Q : Q C Qj}, where 

Proof. Let Q' = {Q e Q : 

^^Pq'dq \Q'\ ^ Iq' Idl ^ ^lls'lli}- Then, by definition, there holds 
denseg(Q') < Let Qj be the maximal elements of Q \ Q', and Qj be defined as in the 

statement. If Q G Q\Q', then \R{Q)\~^ Ir{q) Is'I ^ ^WdW^ some dyadic R{Q) D Q. Let A 
denote the maximal elements of {R{Q) '■ Q & Q \ Q'}- We have 

j QeAj:R(Qj)cQ QeA QeA "^^ 

where we used that Qj C R{Qj), the disjointness of the cubes Qj and the disjointness of the 
cubes of A. □ 

Lemma 9.12. Let Q be an arbitrary collection of dyadic cubes, f G L"^ , and a > 0. Then 
Q=Q'U Uj Qj where size/(Q') < cr||/||2 and Qj = {Q e Q : Q G Qj}, where 

Proof. Let Qj be the maximal elements of Q with {\Qj\^^ Jq. I/P)^^^ > o"||/||2, if any, and 
Qj be defined as in the statement. Then clearly Q' := Q\[jj Qj satisfies size/(Q') < cr||/||2, 
and 



^ 1 /" 1 



2_ 1 
2 IIJ 112 - 



2 
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since the Qj are disjoint by maximality. □ 

Inductive application of the previous two Lemmas leads to the following general result on 
decomposition of an arbitrary collection of cubes Q. 

Lemma 9.13. Let Q be an arbitrary collection of dyadic cubes. Let uq be such that 

denseg(Q) < 22"«||^||i, size/(Q) < 2"«i 
We can write a disjoint union 

no 



-OO J 



where 



(i) dense3(Q^^) < 22*^11 

(ii) size;(Q^^)<2'=||/|h, 

(iii) all cubes in Qj are contained in one Qj, and 

5^|g^l<8■2-2^ 

j 

(iv) both g and f vanish almost everywhere on all Q G 

Proof. Using the previous Lemmas, we first split Q = Q' U [Jj Q'j where densec,(Q') < 
22(no-i)||^||^^ and the cubes of Q'j are contained in Q'j with J2j\Q'j\ < 2-2(«o-i). Next, 
Q' = Q"U[jj Q'^ where size/(Q") < 2"o-i||/||2, and the cubes of Q'^ are contained in Q'^ with 
J2j \Q'j\ — 2^^'^"''^^). We re-enumerate the collections Q'j and Q'j as similarly for the 
containing cubes which satisfy J2j IQ^l — § ' 2"^*^°. Since C Q, these have density and 
size as required, and Q" C Q' has denseg(Q") < 2^("'°~^) and size/(Q") < 2""^^ by construc- 
tion. We may thus iterate with (Q,no) replaced by (Q",no — 1). If some cube Q G Q is not 
chosen to any Q'j at any phase of the iteration, this means that both JqIqI = /g l/P = 0; 
these cubes constitute the collection Q~°°. □ 

Now we are prepared to prove the weak type (1, 1) inequality for L*. 

Proof of (9.4). We may assume that A = 1 and that \E\ < oo, where E := {|IL*(7| > 1}. And 
consider the set G = £" n {Mg < Cd\\g\\i/\E\}. Fixing Cd large enough (depending on the 
dimension d only), there holds 

l\E\ < \G\ < jj^*9\ = {f,^*g) = M,g), 

where |/| = 1g, and Q = {Q G P : QnG ^ 0}; note that Sgf = unless Q G Q. 
The point of passing to the supplementary set G is that we have the estimate 

denseg(Q) < i^lklli- 
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And, as |/| = 1g, it also follows that size/(Q) < 1 = \G\-'^/'^\\f\\2. We apply Lemma 9.13 to 
Q, which yields the corresponding decomposition of Lq. Note that (Lg-oc/, (7) = 0. Hence 

\{\h*g\ > 1}\ = \E\ < \{hQf,g)\ < J2 \{^Q^f,9)\ 
<«:J]size,(Q,^)-dense,(Q,^)-|g)| 

k,j 

<Kj2^''\\fh ■ min{22^ \E\-'}\\9h " ^ 

k j 

<kJ2^'\E\'/' ■ min{22^ \E\-'}\\g\\, ■ 2~'' < K\\g\\^. 

k 

If the scales of S are separated, the factor k does not appear in the application of the Tree 
Lemma 9.6, and we obtain instead that |{|IL*5f| > 1}| < \\g\\i, as claimed. This completes 
the proof. □ 

9.2. The case J §q/ 7^ 0. As mentioned, this only appears in the dual paraproduct case 
where Sq/ = \Q\^^{f, /iq)1q, where hq is a Haar function on Q. In this case, the operator 
L has the form 

^*9{x) = ^ \Q\~^hQ{x){lQn{i{Q)<v(Q)}, g) = \Q\~^hQ{x)aQ{lQ,\g\) =: S\g\{x), 

where {agl < 1 are some numbers, of course depending on g. However, the new shifts 
:= X^QeD /) uniformly bounded on with respect to the choice 

of the aq, hence by the weak- type estimate for untruncated shifts, also uniformly bounded 
from to 1^'°°. Thus 

||L*5'||li,c=o = ||§5(||ii,oo < \\g\\Lh 
and we are done in this case as well. 

10. Distributional Estimates 

10.1. A John-Nirenberg Estimate. We recall this formulation from [11, Lemma 5.5]. Let 
be a scales seperated grid, as defined in Definition 9.1. 

Definition 10.1. Let {(pQ : Q G P^} be a collection of functions such that (pg is supported 
on Q and is constant on its DK-children. For Rq G "Dk let be a maximal function 

0i?o(^) •= sup (f)R{x) 
QcRcRo 

Lemma 10.2. Let {(pq : Q G P^} be a collection of functions such that 

(1) (pq is supported on Q and constant on the V^-children of Q; 

(2) < 1; 
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(3) There exists 6 G (0, 1) such that for all cubes R G "Dk 



X e R: (f)*jiix) > 1 



< S\R\ 



Then for all R G and for all t > 1 

ijx eR:(l)*R{x) >t}| < 5^'-^^/^\R\. 

10.2. The Corona And Distributional Estimates. We need the important definition of 
tlie stopping cubes, and Corona Decomposition. Tlie grid V^, <Z V lias scales separated, as 
in Definition 9.1. 

Definition 10.3. Let w be a weight and Q G P^- We set the w-stopping children S{Q) to 
be the maximal subcubes Q' G Q with Q' G so that 



(10.4) 



w{Q') ^ w{Q) 



\Q'\ 



> 4- 



\Q\ 



Setting So = S{Q), and inductively setting Sj^i := Ug'e^j "^iQ')^ refer to S := IJjlo'^i 
as the w-stopping cubes of Q. 

The w-Corona Decomposition of a collection of cubes Q with Q' G Q for all Q' G Q, 
consists of the w-stopping cubes S, and collections of cubes {V{S) : S* G 5} so that (1) the 
collections V{S) form a disjoint decomposition of Q, and (2) for all S G S, and Q' G 7^(3), 
we have that S is the minimal element of S that contains Q'. In particular, we have 



wiQ') ^MS) 



\Q'\ 



< 4- 



1^1 



Q' G V{S), S gS. 



The previous definitions make sense for any weight w. Specializing to w G leads to the 
following elementary, and essential, observations. The first is a familiar inequality, showing 
that an Ap weight cannot be too concentrated. 

Lemma 10.5. Let w G Ap, Q is a cube and E G Q. We then have 



(10.6) 



\E\ 

m 



\w\\-/ < 



w{E) 
HQ)' 



Proof. The property that w > a.e. allows us to write 



\E\ 
W\ 



xiwix] 



-^/P dx 



< 



\Q\ 

w{Ef/Pa{QY/p' 

\Q\ 

w{E)Y/Pw{QY/'Pa{QYl'p' 



w{Q) 



\Q\ 



which proves the Lemma. 



□ 
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The second is a direct application of the previous assertion. 
Lemma 10.7. Let w G Ap, Q a cube, and S the w-stopping cubes for Q. Then, we have 
(10.8) 



Proof. It follows from (10.4) that we have that the union of the stopping children S{Q) has 
Lebesgue measure at most \\Q\. Applying (10.6) to the set E = Q\[js&s{Q) follows 
that 



> {3/4y\\w 

Thus, we have 



1^ > (3/4)1k||- 



w{S) < {I - c\\wr^l)w{Q) . 

SdSiQ) 

Recalling the inductive definition of the w-stopping children, we see that our estimate holds. 

□ 

10.3. The Distributional Estimates. We combine the John-Nirenberg and Corona De- 
composition to obtain the crucial distributional estimates: The operator L, decomposed 
using the Corona, satisfies exponential distributional inequalities. We will then strongly use 
the exponential moments to control certain norms in the next section. Our estimates 
are two fold. The first inequalities involve the Lebesgue measure, which are the important 
intermediate step to obtain the second inequalities for the a measure. 

Definition 10.9. For w G Ap, and integers a, k G N, we will say that a collection Q 
of cubes is (w, a, K)-adapted if these conditions hold. First, for any Q, Q' G Q, we have 
log2^(Q) = log2^((5') mod (k + 1). Second, we have 



(10.10) 2" < 



101 L 101 J 

We only need to consider < a < [log2||w||Apl • 

Lemma 10.11. Let w G Ap, with dual measure a, and let Q be a cube. For integers a,K, 
let Q be a collection of cubes contained in Q which are {a,a, k)- adapted. Construct the w- 
Corona Decompositions V{S), S E S of Q. Let S be a Haar Shift operator of complexity k, 
with \\S\\l2^]^2 = 1. We have these estimates, uniform in t > 1, choices of linearizations h, 
functions ip with \\^p\\oo < 1; and S G S: 

wJS) 

\S\ 
wJS) 

\s\ 

where K is a dimensional constant. 



(10.12) \{xeS : L;,(s)(15<^^) > < 2-'\S\ 

(10.13) a{{x G S : h*T,^s)i'i-sVw) > Kt^}) < 2-'a{S) . 
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Observe that the condition that Q be (a, a, /€)-adapted imphes in particular that all Lq/, 
with Q C Q, can be viewed as linearizations of shifts with separated scales; this will place 
the stronger conclusion (9.5) of Theorem 9.3 at our disposal, allowing us to get the stated 
complexity-free estimates (10.12) and (10.13). The proof given below follows almost to the 
letter the proof of [11, Lemma 5.6]. 

10.4. Proof of the distributional estimate for the Lebesgue measure. We begin with 
the bound (10.12). We aim to apply the John- Nirenb erg estimate. Lemma 10.2. To this end, 
write 

r^^^,c^ Jor]\e(x)<\Q\<v(x)\ 



Q&r{S) 



and 



jji[x) := sup 

Q&V^ : Q3X 



y[x] 



Q'eV{S): QCQ'CR 



for R e (note that 0Jj = 0, if i? ^ 5). Then |Lp^^j(l5V9w)| < so that it suffices to 
prove (10.12) for (f)*g instead of h^^g,^{ls'^'w). The condition (1) of Lemma 10.2 is clear: the 
function (pq is supported on Q, since the kernel sq is supported onQxQ. Also, (f)Q is constant 
on P^-children of Q. The condition (2), that ||0q||oo ^ need not hold for cubes Q G Vi^S), 
so we have to split these cubes into countably many subfamilies. For /3 G Z+ = {0, 1, 2, . . .}, 
let Vp^S) consist of all cubes Q G 7^(5*) such that 



-p w{S) ^ w{Q) p^^ w{S) 
\S\ - \Q\ \S\ ■ 



For every Q eV{S) it holds that 



\Q\ \s\ ' 

which means that every Q G V{S) is contained in Vpi^S) for some /3 G Z_,_. Defining in 
the same manner as 0^, only replacing 7^[S) by 7^is[S) in the definition, we also have 



This will allow us to prove (10.12) for the functions separately. If Q G Vp{S)^ we have 
l0Q(y)l<^ / Mx)dx< 4-/^+1^, 

IVl J Qr\{e{x)<\Q\<v{x)} PI 
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by definition of Vp{S). Tlius condition (2) of Lemma 10.2 holds for the normahzed functions 
4:^~^\S\w{S)~^(j)Q, Q G Vp{S). In order to verify condition (3) for 0^^^, we need the weak- 
type (1,1) inequahty for L* obtained above. To use this inequahty, we first have to show that 
the set {x G -R : (p*pi^p{x) > A} is a subset of 

{xeR: |L*2^(^)(1r(/.^i;)(x)| > A} 

for some subcoUection Qpi^R) C Vjs^S). Let Qi, Q25 • • • be the maximal subcubes Q C i? in 
satisfying 

^ 0Q.(x) > A. 

Q'&Vf3{S): Q<ZQ'CR 

Note that every point in {x G i? : > A} is contained in exactly one Qj and, in fact, 

{x e R: 0K,/3(x) > A} = Uj Qj- Then let 

00 

Qp{R) ■■= [j{Q e Vp{S) : c g c R}. 

Now suppose 4>*jib{x) > A, and let Qj be the unique cube containing x. Then, by definition. 



A < 



E 

QeVaiS): QjZQGR 

E 



X] 



Qn{e{y)<\Q\<v{y)} 



SQ{y,x)ls{y)^{y)w{y) dy 



Since every cube in the sum is a subset of R, we may replace I5 by 1r above. The result is 
precisely \h*Q^^j^^{lRipw){x)\. 

Let K > 1 be the dimensional constant of Theorem 9.3 for the weak-type inequality (9.5) 
for shifts with scales separated. Recalling that this separation is satisfied in our situation, 
we have in particular that 



|L*2,/|>A}|< 



K 



A 



A > 0, 



for any collection Q' C Q. Choosing A = 2 • 15*1 ^w{S)4: ^+^K now yields 

\{xeR: <P%p{x) > A}| <\{xeR: \h*Q^^^^{lji^w){x)\ > A}| 

K\S\\\lRwh _ \S\-w{R) 
2-w(S)4-'3+iK 2 • w;(5)4-/3+i ■ 

If -R G VjS^S), we immediately obtain \{x E R : (t)\p{x) > A}| < \R\/2. However, Lemma 
10.2 requires the same estimate for all R G "Dk- Let Ri, R2, ... be the maximal cubes of Vp{S) 



< 
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inside R. Then is supported on the disjoint union of the cubes Rj, and (p^^pix) = (pR.^jsix) 
for X & Rj. Hence 

°° °° I p I I PI 

\{xeR: X-'<P%^{x) >1}\ = Y. \{x G R, : > A}| < ^ < ^ • 

i=i j=i 

Since A > \S\^^w{S)4:^'^^^ , we still have ||A^"'^0q||oo < 1, so that finally the functions A~^0q, 
and the related A~^0|j^, satisfy all requirements of Lemma 10.2. We conclude that 

\{xeS: (f)l^{x) > tX}\ < 2-(*-^)/2|5|, t > 1. 

Note that the inequality is trivial for < t < 1, so that it holds for alH > 0. Recalling the 
definition of A and rescaling t we can rewrite the previous as 



(10.14) 



|a:G^:0^»>16t^| < ■ 2-*^^/'<|^|, t > 0. 



To finish the proof of (10.12), we use (p's ^ ^ 4'*s p '^o estimate 

MS) 



\s\-' 



X e S : (f)*six) > I6t- 



< 



13=0 



xeS : (p*s,^{x) > 16 • 2 



(*) 



\S\ 



< ^ V2 • 2-*-2''"/K < V2 J2 2-*/2'<-/3 = 2V2 ■ 2-*/2K. 

13=0 13=0 

The inequality (*) only holds in case t > 2K, but for t < 2K the inequality just obtained is 
trivial. Replacing t by Kt proves (10.12). 

10.5. Proof of the distributional estimate for the a measure. In order to prove (10.13), 
we need the assumption that our cube family is (cr, a, K)-adapted. Namely, recalling also the 
definition of Vp{S), we have the inequalities 



151 



w{S) 



< 



\Q\ J w{S) 



Thus the measure a is not impossibly far away from Lebesgue measure on the cubes Q G 
VfflS), and this allows us to make use of the previously established estimate for the Lebesgue 
measure. As was already noted during the proof of (10.12), any set of the form {x G S" : 
4>*s p{x) > A} can be expressed as the disjoint union of the maximal cubes Q G for which 
Q <Z S and 



(10.15) 



E ■ 



> A. 
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Apply this with A = 20t ■ w{S)/\S\, and let Qi,Q2,-- - be the maximal cubes mentioned 
above. Note that it cannot happen that Qj = S, since then the summation condition in 
(10.15) would be empty {S C Q' C S), and the estimate (10.15) could not possibly hold. 
Thus Qj C S for every j, but there is no reason why Qj G V(i{S). Instead, Qj G Vf^^S), 
where Qj denotes the parent (in Vk) of Qj. This follows from the maximality of Qj and the 
fact that the summation in (10.15) is only over Q' G Vfs^S). Now let 

oo 

Mt) ■■=[jQr 



This union may be assumed disjoint, since it is anyway the disjoint union of the maximal 
cubes among the cubes Qj G Vp^S). Recalling the estimate |0q(x)| < 4~^^'^w{S)\S\~^ used 
already in the proof of (10.12), we may now estimate 



Q'<=iVp{S): Qj<ZQ'CS 



> 



> 20t 



'eVis(S): Qj<ZQ'cS 

!£05)_4.4-/3!^5l6t^ 

\s\ \s\ - \s\ ^ 



X G Qj. 



Of course, (*) only holds in case t > . One should also observe that we needed here the 
fact that the sum in the first expression is constant on Qj, the "Dk child of the first cubes 
Q' in the actual summation: this becomes essential, if x G Qj is not in Qj to begin with, 
since we only have the inequality (10.15) for x G Qj. The previous estimate now shows that 
Epit) C {x : (t)sA^) > 16t • w{S)/\S\} for t > 4"^, whence 



\Ep{t)\ <V2-2~'^'/''\S\ <2-2 



1^1 



by (10.14). For t < 4 this estimate says nothing, so the same bound holds for all t > 0. Now 



sum over the disjoint cubes that form i?/3(t) and use a{Q) < (2°'~^^4:^w{S) ^\S\) 



\Q\ to 



obtain 



aiEM) < f2""^4/^^)''^^"V.(t)| < f2"-^4'^^V'^^"^2.2--^/^5| 



wiS) 



wiS) 



< 



^2 . 4^)i/(p-i)^ . 2 . 2-*4''/^|^| = 2^' ■ A^/^P-'^ ■ 2-*^'/V(^). 



In the last inequality we used again the fact that the cubes in V{S) are (a, a, K)-adapted. 
The estimate for cr({x G S : (p*s{x) > 20t ■ w{S)/\S\}) is now obtained in the same manner as 
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at the end of the proof of (10.12): 



xeS: (j)*s{x) > 20t^^ 



< a 



OO f 

(S)-E I 



(10.16) 



13=0 13=0 

If t < 2K, we clearly have 

MS) 



X eS : d)%(x) > 20t- 



\S\ 



<2-2-*/2V(S). 



We wish to prove a similar bound in case t > 2K by estimating the last series on line (10.16). 
Since 2^^ > (2/(p - 1) + l)/? + 1 for /3 > we have 

2/3/{p - 1) - t2^/2K <-l3- t/2K, P > Pp 

Thus 



t/2K 



f}=0 



Hence 



/3=0 



xeS : (h%(x) > 20t 



\s\ 



< Cp-2-*/2K^(5). 



in each case. This completes the proof of (10.13), and thus of Lemma 10.11. 

11. Verifying the Testing Conditions 

The main Theorems of §4 state that we can reduce the verification of the estimates of the 
main technical Theorem 2.10 to this Lemma. 

Lemma 11.1. For an L^-hounded Haar shift operator S with complexity k, a choice of 1 < 
p < OO, and w G A.p, we have these estimates uniform over selection of dyadic cube Q and 
choice of linearization L of the maximal truncations S^. 

||lQL*(^y;lQ)||^,.(,) < KWwUMQf''' ' 
^ ^ \l.*{lRipw){y)\a{dy) 



(11.2) 
(11.3) 



sup 



R 



LP{w) 



< «:ll^||y^^V(g)^/^ 



In these estimates, p' = p/{p — 1) is the conjugate index, a = , and if is a measurable 
function with \\^p\\od < 1- 

We shall return to the proof of the Lemma above, turning here to the proofs of the two 
main technical estimates in Theorem 2.10. 
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Proof of the weak-type estimate (2.11). We apply the inequality (4.4) from Theorem 4.3 giv- 
ing sufficient conditions for the weak-type inequality in the general two-weight case, and 
then Buckley's bound for the weak-type maximal inequality constant OJtp^weak and the bound 
(11.2) for the 'backward' testing constant Tp in the one- weight case: 



ll^h(/^)llLP.°°{ti>) ~ (^^p,weak + 2^p; \\J \\LP{a) 

^ (^IkllYj + ZtlklU^) \\f\\LP{a) < l^\H\Ajf\\LPia), 

noting that the second term dominates since > 1. □ 

Proof of the strong-type estimate (2.12). We apply the sufficient conditions in the two weight 
setting of Theorem 4.7, and then the available estimates for the testing constants in the one- 
weight situation: Buckley's bound for the strong-type maximal inequality constant and 
the bounds (11.2) and (11.3) for the 'backward' and nonstandard testing constants Tp and 9^^: 

In this case either the ffist and third, or the second term dominates, depending on whether 
p < 2 or p > 2. □ 

We turn to the proof of lemma 11.1. 

Proof of (11.2). The data is fixed: of index p, weight w G Ap, cube Q, function ip bounded 
by 1, and Haar Shift S of complexity k. 

The definition of L* is as in (3.4), and is in particular a sum over all dyadic cubes P. Now, 
the sum in (3.4) can obviously be restricted to a sum over P that intersect Q. Moreover, 
the sum over P that contain Q can be controlled, using a straight forward appeal to the size 
conditions on the Haar Shift operators, and the definition of Ap. 

Thus, in what follows, we need only consider cubes which are contained in Q. We split 
these cubes into {a, a, /t)-adapted subcoUections, in the sense of Definition 10.9. Namely, we 
form the k + 1 subcoUections according to the value of \og2i{Q) mod {k + 1). And each of 
them is further split according to the unique number a such that 

- \Q\y\Q\) ' 

where < a < [log2||w||Apl is an integer. 

For any one of these (a, a, K)-adapted subcoUections, say Q, apply the Definition 10.3, 
to get ly-stopping cubes S, and corona decomposition {P{S) : S G S}. Define functions 
¥s := hp^g-^{wipls)- For these functions, we have the distributional estimate (10.13), and it 
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remains to show that 

(11-4) \\Y.^sl,^<2-/n\w\\]l;wiQy/r 

To conclude (11.2), we sum this estimate over a to get the upper bound \\w\\ApW{Qy^^' and 
then over the k + 1 possible values of log2i{Q') mod (k + 1) to get the estimate (11.2) as 
stated, with the factor n. 

The distributional estimates are exponential in nature, which permit a facile estimate of 
this norm. Define level sets of the functions F5 by 

Fs,o :={|F,|<K^}, 

Fs,n := {Kn^ < \Fs\ < K{n + 1)^} , n > 1 , 

where K is the dimensional constant in (10.13). And then we estimate, by a familiar trick. 



Ses n=o ses 



2 °^ 

n=0 ses 



n=0 ses ' ' 



1^1 



-1 n' 

1 



n=0 S£S 

Here, we use Holder's inequality in the first inequality, and in the fourth, we use the fact that 
the averages ^^^j^ increase geometrically along the stopping intervals. This is wasteful in the 
powers on n, which is of no consequence for us. 

We then prove (11.4) as follows, absorbing the dimensional constant K into the constants 
implicit in the notation <. We have 

n=0 ses ' ' 

w{S)ip'-^(x{S) 



— L|^|J \S\ 



(11.5) <2(^'-^)" J]^(5) <2(^'-i)°||z^iU^^(Q) 

Ses 
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To pass to the second line, we use the distributional estimate (10.13), giving us a sum in n 
that is trivially bounded. From the third to the fourth line, we use the condition (10.10) of 
Definition 10.9 and then use the estimate (10.8). □ 



Proof of (11.3). Fix the relevant data for this condition, and take a dyadic cube R C Q. We 
take L|j to be as in (3.4), but with the sum restricted to cubes that contain R. Then, 



sup -j^ 



\h*j^{wiplR) \ a{dy) < sup 



R 



RCQ 



<y{R) 
\R\ 



< M(alQ) . 



And this last term is controlled by Buckley's estimate for the Maximal Function. Thus, when 
we consider the expression 

-J- / \l.*{lRVw){y)\a{dy) 

w[R) Jr 

we can in addition assume that all cubes contributing to L* are contained in R, namely we 
can replace L* by IL'^(^), where TZ{R) is an appropriate collection of cubes contained in R. 

To make a relevant estimate of this integral, we can consider Q a collection of cubes Q' G Q 
that are {w, a, K)-adapted, where < a < [log2||w||Apl • Then, we will show that 



(11.6) 



supli?— — / \'L*T^^R){lRipw){y)\a{dy) 

RCQ w[R) Jr 



LP(w) 



where the collections 1Z{R) are those cubes Q' E Q with Q' C R. Summing this estimate 
over < a < [log2||w||A ] and the k + 1 possible values of logj d-iQ') mod (k + 1) will prove 
(11.3). 

We then apply Definition 10.3, letting S{R) be a collection of stopping cubes for 71{R), 
with Corona Decomposition {^(5*) : 5* G S{R)}. Then, we have 



w 



seS(R) ' 



< K 



w{R) 



S&S{R) 



\S\ 
w{S) 



i/(p-i) 



Here, we use the estimate (10.13). Recalling that K is only a dimensional constant, we 
henceforth absorb it from the notation. And so, to prove (11.6), we should show that 



11.7) 



sup Ifl^^ 

RCQ w[R) 



SCS{R) 





l/{p-l) 







< 



LP(w) 



i/p 
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We estimate 



seS{R) 



\s\ 



wiS)i 



i/(p-i) 



k=0 5e5fc(/?) 



151 



w{S)i 



i/(p-i) 



<E E -(s) 

fc=0 5e5fe(R) 
oo 



R\ 



< 



k=0 

w{R) 



\R\ 



w{R)i 



w{R). 
i/(p-i) 



w{R). 

i/{p-i) 



i/(p-i) 



(i?)[E-(^-ilQ)] 



i/(p-i) 



where we used the fact that the terms ^^j^ decrease geometrically along the levels of the 
stopping cubes. Therefore, to see the estimate (11.7), we should estimate 



'LP (w) 



by the universal maximal function estimate. 



□ 



12. Variations on the Main Theorem 



In this final section, we present a couple of variations of the main result. Theorem 1.2, 
where we allow the appearance of different weight characteristics than just ||w||ap in the norm 
estimates in L^(w). First, as a direct consequence of Theorem 1.2, we deduce the following 
result, which was conjectured by Lerner and Ombrosi [19, Conjecture 1.3] for untruncated 
operators T: 

Corollary 12.1. ForT an L^(]R'^) bounded Calderdn-Zygmund Operator and 1 < q < p < oo, 

\mf\\LP{w) < CT,p,q\\w\\Ajf\\LP{w)- 

Proof. Choose some r G (p, oo). Since Hw^lUr ^ ll'^IU,, the weak-type estimate of Theo- 
rem 1.2 shows that 

\\Ti,f\\Li'°°{w) < C'T.gllw^lUq 



< CrrWwl 



Li{w)} 

L'-(to) < C'r.rll'W^IU, Hi \\L^{w)- 



It suffices to apply the Marcinkiewicz interpolation theorem to the sublinear operator to 
deduce the asserted strong-type bound in Wiw). (For p > 2, we could also have used the 
strong- type estimate of Theorem 1.2 and lUp ^ ll'^lUg? without any interpolation.) □ 

In order to describe the other variant of the main theorem, we consider the A^o character- 
istic 

(12.2) \\w\\a^ :=sup^ / M{w1q), 

Q ^[Q) Jo 
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which has been introduced (with a different notation) by Wilson [26], and more recently 
used by Lerner [17]. One can show (see [10]) that < Crf||w||^p for all p G (l,oo), and 

therefore the following estimate is an improvement of Theorem 1.2: 

Theorem 12.3. For T an LF'{W^) hounded Calderdn-Zygmund Operator and 1 < p < oo, 

mf\\Lr>M < CrAMtMZ + lkliyr'^)ll/IUn-)- 

Note that the strong-type bound is a strict improvement of the corresponding bound in 
Theorem 1.2 only for 2 < p < oo. For 1 < p < 2, the term llwHYJ^"^'' dominates, and this is 
exactly the same bound as in Theorem 1.2. 

An earlier result of this type is implicitly contained in the work of Lerner [17, Section 5.5]: 

\mf\\LP(n,) < CT,p|k||X^lkllA!ll/IU''H' 3 < p < oo. 

Our estimate improves on this, as both terms < HtfllY^ (for p — 1 >2) and 



|i/Pi 



W _ II ||1/2|| ||l/2 /IklU^ V/2-1/P 



\w\\j. \\w\\; = ; \\wuA 



are smaller than ||w||y^||w||y^ for 3 < n < oo. 

II II J^p II II ^oo — ^ 

On the other hand. Theorem 12.3 fails to reproduce the following bound for untruncated 
operators T, which was recently obtained in [10]: 

r/iu^M < \\w\\]I^{\\w\\a^ + \\w-'\uS^"\\f\\LH^). 

We do not know whether this bound remains true for T^^ in place of T. More generally, 
although the optimal Wiw) bounds in terms of coincide for T and Tt,, we do not 

know if this is the case when allowing the more complicated dependence on both Ap and 
characteristics. 

We now discuss the proof of Theorem 12.3. Just like the proof of Theorem 1.2, thanks to 
the Representation Theorem 2.5, this is a consequence of an analogous result for Haar shifts: 

Theorem 12.4. Let S be a Haar shift operator with complexity k, a paraproduct, or a dual 
paraproduct. For 1 < p < oo and w G Ap, we have the estimates 

||§^/IUf.-M < l^\H\A!!\H\A^\\f\\LPM , 

\ml.h < <Mii:\\w\\z + \M]i^'-y\f\\L.M. 

Sketch of proof . We can exploit the same two-weight estimates as before. 
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and it only remains to see how the A^o norm can be incorporated into the estimates of the 
testing constants appearing here. Recall that we have 

-'-"■p,weak ^ II , -'•^'^p ^ II "^lUp ) •'^P ^ '^ll'^IUp ) 

which are of admissible size for Theorem 12.4. The improvement over the earlier bounds 
comes from a more careful estimation of Tp. 

Recall that the estimate Tp < is proven in Lemma 11.1, Eq. (11.2), and that this 

proof is reduced to the estimate (11.4). The final step in the proof of this estimate, (11.5), 
is an application of Lemma 10.7, which says that the collection S of w-stopping cubes for a 
cube Q satisfies 

It is this last bound where 11^11^^ can be replaced by ||u^||aoo5 observed in [10]: 

J2'^{S)<\\w\\a^w{Q). 
ses 

Using this bound in (11.5), we obtain 

i,<4w\\t\H\Z^ 

and thus the estimates as asserted in Theorem 12.4. □ 

Remark 12.5. It is shown in [10] that there is the sharper bound VJlp < (||'U^IUp||cr|Uoo)^^^, 
where a = w^^^^^^^\ However, this does not allow us improve on the above estimates, as our 
bound for OTp, namely ^\ is already as large as Buckley's classical bound for Tip. 

Remark 12.6. Another, perhaps more commonly used definition of the A^o characteristic is 
the following quantity introduced by Hruscev [5]: 



The Aoo characteristic as defined in (12.2) is always smaller (up to a dimensional constant), 
and it can be much smaller for some weights (see [10] for details), so Theorems 12.3 and 12.4 
are sharper when stated in terms of ||w|Uoo in (12.2). 
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